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While conventional computers must be programmed in a logical fashion by a person who 
thoroughly understands the task to be performed, the motivation behind neural networks is to 
develop machines which can train themselves to perform tasks, using available information about 
desired system behavior and learning from experience. 

Goals of the project begun under the Faculty Summer Fellowship program were threefold: 

1 ) to evaluate various neural net methods and generate computer software to implement those 
deemed most prom ising on a personal computer equipped with Matlab 

2) to evaluate methods currently in the professional literature for system control using neural 
nets to choose those most applicable to control of flexible structures 

3) to apply the control strategies chosen in 2) to a computer simulation of a test article, the 
Control Structures Interaction Suitcase Demonstrator , which is a portable system 
consisting of a small flexible beam driven by a torque motor and mounted on springs tuned to 
the first flexible mode of the beam. 

At the present time, the first two goals have been met, and work on the third is on-going. Results of 
each will be discussed below. 

Using many references, the currently available methods for training neural nets were examined 
and evaluated for ease of implementation, reliability, computer requirements, and applicability to 
control systems. Some methods were rejected because of the vast numbers of neurons required to 
work practical problems (e g. , Bidirectional Associative Memories); some, for example Boltzmann 
machines, because of the very large amount of computer time required to train the nets; and some, 
like Hopfield nets, for the extreme difficulty of implementation ( in order to utilize a Hopfield net, a 
Lyapunov function must be gener ated for system "goodness" and appropr iate weight adjustments 
based on that Lyapunov function must be determined- -a procedure requiring vast “mathematical 
exper tise and ingenuity" [ 1 ] ). While there is currently no optimum method for neural nets, after 
careful evaluation, back -propagation was chosen as the most practical choice for implementation. 
This method changes network weights proportional to the partial derivative of the system error 
function with respect to each weight. This approximates a gradient descent procedure, and therefore 
assures that the system will reach an energy minimum. Difficulties with back -propagation include 
possible network paralysis if neurons saturate, the possibility of reaching a local rather than a 
global minimum , and long training times. However , the method is very easy to implement 
algorithmically, and is used in the majority of the controls applications appearing in the current 
literature. Methods have been proposed to fix difficulties with back -propagation, but each has its 
own associated problems ( for example, Cauchy training eliminates the problem of convergence to 
local minima, but has a greater instance of network paralysis than systems using back -propagation, 
and a training time one hundred times that of the already lengthy back -propagation training). Thus 
back -propagation was chosen as the neural net training method to be implemented. 

Using Matlab . software was generated implementing a back -propagation trained neural net on an 
IBM compatible personal computer. For a given problem, number of layers and number of neurons 
must be "empirically determined," [2] so neural nets of several sizes and configurations were 
compared. Some authors have hypothesized that fewer neurons may be used for a given problem if 
those neurons are arranged in more layers [ 1 ]. In the trials conducted, no network was found which 
failed to converge eventually, so no evidence was obtained to support or disprove this hypothesis. 
However , empir leal evidence does suggest that given that both will eventually converge to a solution , 
a neural net with fewer layers will converge more quickly. Figure 1 , showing the error measure 
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( total sum square error) versus number of training epochs for a two- layer neural net ( with one 
nonlinear hidden layer and a linear output layer) and a three- layer net (with two nonlinear hidden 
layers and a linear output layer), is typical of the results generated. 

In the second phase of the project, recent publications in the professional literature regarding 
applications of neural nets to control problems were examined and compared. Methods currently 
available can be divided into roughly three categories: 

a) methods in which a neural net is trained to emulate a currently existing controller , whether 
human or computerized ( such as [ 3] ) ; 

b) methods in which nets generate some state or function which is then used in a standard 
controller design ( for example, [ A] in which the neural net is used to generate estimates of 
unknown nonlinear system parameters, which are then used In a standard adaptive 
controller); 

c) methods in which the neural net generates a controller for an unknown system without 
human intervention [2]. 

Of the three types, the third is by far the most sophisticated, as it assumes no mathematical 
knowledge of the system to be controlled, and does not require a human to be able to control the 
system or to generate 8 controller which successfully does so. This would mean that nonlinear 
systems which could be modeled poorly, if at all, theoretically could still be successfully controlled 
by a trained neural net. It was decided that such a method would be the best candidate for controlling 
flexible space structures. 

The particular method choosen for application to the test system W8S that in [ 2] . This is a time 
back- propagation system. First, a neural net must be trained to emulate the behavior of the 
unknown system using standard back-propagation methods. This trained emulator is then used to 
train the controller as follows: 

1 ) A time trajectory for system behavior is generated, with the untrained controller 
generating essentially random inputs to the emulator. 

2) The final emulator output is compared to the desired output. 

3) The error is propagated beck through the emulator to generate an equivalent controller 
error, which is used to train the controller. 

A) The process is continued, propagating back through each time step of the trajectory until the 
controller has been trained for all time steps. 

5) Steps 1 -A are repeated for many trajectories. 

Currently on-going is work applying the method in [2] to the test article. The neural net chosen 
for use had one hidden nonlinear layer containing 35 neurons and a linear output layer of 1 0 
neurons to scale the outputs. One problem in implementing the method was difficulty in obtaining 
accurate training data for the CSI Demonstrator ; the final data was generated by Mark Whorton and 
John Sharkee of NASA, using a Matlab simulation of the system. 

Another difficulty encountered was ill-conditioning of the data. Although it was mentioned 
nowhere in the literature, it was discovered that if inputs to the neural net vary by several orders 
of magnitude, as is the case of the Demonstrator, the nonlinear neuron layer soon saturates, so that 
training of that layer comes to a virtual standstill. This causes the nonlinear layer to send the same 
input to the linear layer regardless of the system input, causing the linear weights to grow without 
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bound as they try to adjust to give varying outputs a constant input. This causes the error measure 
to grow without bound. This problem W8S solved by scaling the trajectories of very large system 
states to bring them down to the level of the others and prevent layer saturation. 

It was also discovered that the 8088 PC being used for software development was too slow to be 
practical in training a neural net to emulate the test article; currently the software is being run on 
an 80386 machine with 1 0 ,000 training patterns comprising a single epoch ( a single epoch takes 
approximately 45 minutes on the 80386 and over 24 hours on the 8088). As yet, the emulator has 
not converged to 2ero error, but to a TSS of approximately 4. Figure 2, of TSS versus epoch 
number . shows this convergence. When state trajectories for both the system and the emulator are 
compared, results for different states range from that in state X j where the emulator does not 

adequately follow the system response (Figure 3) to state X3 (Figure 4) in which the two are 

practically identical. Possible reasons for this include an inadequate training set ( i.e. , one which 
does not fully span the state space) and a neural net with an inadequate number of neurons and/or 
layers. Work to perfect the emulator is continuing. At such time as the emulator adequately 
predicts all system states, the controller will be trained as part of an on-going effort during the 
coming academic year. Once the controller is trained, its performance may be compared to the 
currently existing controllers for the system in terms of computation requirements, robustness, 
etc. Other neural net strategies, such as using the neural net as an estimator for system parameters 
needed by standard adaptive controllers, could also be addressed at some future time. 

While neural nets have yet to be fully evaluated as a tool for control of nonlinear or poorly 
modeled systems, they show great potential in this area, and deserve further consideration and study 
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